A Discrete Arabic Script for Better Automatic Document Understanding
نویسنده
چکیده
لمعتست ةديدج طوطخ ريوطتل ساسلأا ثحبلا اذه عضي ةلصفنم فورحب ةيبرعلا صوصنلا ةباتآ يف . طوطخلا هذه دعاست ىلع ،بتكلا ةعابط يف لمعتست نأ نكميو تادنتسملل يللآا مهفلا و ،دئارجلا و ىرخلأا داوملا لآ و ،تايرودلا ةعوبطملا . طوطخلا هذه لثم جاتنإ دنع يه امآ ىقبت ةيبرعلا ةباتكلل ىرخلأا صئاصخلا لآ نإف عبطلابو . و رورملاب ةلصفنم فورحب ةيبرعلا صوصنلا ةباتكل انتوعد معدت ةيوق ةجح مدقن هتأشن ذنم يبرعلا طخلل ثدح ام ىلع . انكمت دقل ةيسأر ءاضيب تاصقب فورحلا هذه لصف نكمي ثيحب ةلصفنم فورحب ةباتكلل ةديدج ةيبرع طوطخ ريوطت نم . دق و تمّت امب نيلماع ةسارد ةديدجلا تابلطتملا بساني : ارفلا اغ راسي ىلع ن فرحلا و هنيمي . دقو تاحفص عست انمدختسا A4 ذ اطخ نأ ةجيتنلا تناكف تاغارفلا هذهل ةبسانم ميق ىلإ لوصولل ا غارف هرادقم ١٦٠ راسي ىلع طوطخ ةدحو فرحلا هنيميو لداعي احاجن ققح دق 99.99% فورحلا لصف يف .
منابع مشابه
Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملSkew Detection, Page Segmentation, and Script Classiication of Printed Document Images
Automatic processing of international documents presents a number of challenging problems because Optical Character Recognition (OCR) techniques are not available for all languages and all script classes. Document images must be categorized according to their script type rst, in our case Roman, Ideographic, or Arabic. We present a set of statistical methods that rst detect and correct the skew ...
متن کاملAutomatic script identification from images using cluster-based templates
We describe a system that automatically identifies the script used in documents stored electronically in image form. The system can learn to distinguish any number of scripts. It develops a set of representative symbols (templates) for each script by clustering textual symbols from a set of training documents and representing each cluster by its centroid. "Textual symbols" include discrete char...
متن کاملHandwritten Script recognition using Soft Computing
Today, handwritten script recognition is challenging part in the computer science. It is important to know a script used in writing. Script recognitions have many important applications like automatic transcription of multilingual documents, searching document image, script sorting. Proposed work emphasis on the “block level technique” where script recognition recognizes the script of the given...
متن کاملHandwritten Script Recognition Using DCT, Gabor Filter and Wavelet Features at Line Level
In a country like India where more number of scripts are in use, automatic identification of printed and handwritten script facilitates many important applications including sorting of document images and searching online archives of document images. In this paper, a multiple feature based approach is presented to identify the script type of the collection of handwritten documents. Eight popula...
متن کامل